Coupled Hierarchical IR and Stochastic Models for Surface Information Extraction

نویسندگان

  • Hugo Zaragoza
  • Patrick Gallinari
چکیده

We present in this paper a combination of Machine Learning based Information Retrieval (IR) techniques and stochastic language modelling in a hierarchical system that extracts surface information from text. At the lowest level of this hierarchy, documents and paragraphs are successively routed with IR techniques. At the top level, a stochastic language model extracts the most relevant phrases, and labels the type of information they contain. The approach and preliminary results are demonstrated on a subset of the MUC-6 Scenario Templates task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Automatic Surface Information Extraction System Using Hierarchical

We address the problem of constructing an automated information extraction system from a corpus of text. We present in this paper a novel approach to textual information extraction (IE) combining information retrieval (IR) techniques and stochastic language modelling in a hierarchical system. This permits to largely reduce the amount of data treated with complex (slow) analysis systems, it allo...

متن کامل

Extraction of Some Divalent Metal Ions (Cadmium, Nickel and Lead) from Different Tea and Rice Samples Using Ghezeljeh Nanoclay (Geleh-Sar-Shoor) as a New Natural Sorbent

This article presents the method of extraction-preconcentration of Lead, Cadmium, and Nickel ions from food samples using the Ghezeljeh montmorillonite nanoclay (Geleh-Sar-Shoor) as a new native adsorbent in batch single component systems. The extraction-preconcentration of heavy metals were carried out by applying the solid phase extraction (SPE) method followed by atomic abs...

متن کامل

Application of Stochastic Optimal Control, Game Theory and Information Fusion for Cyber Defense Modelling

The present paper addresses an effective cyber defense model by applying information fusion based game theoretical approaches‎. ‎In the present paper, we are trying to improve previous models by applying stochastic optimal control and robust optimization techniques‎. ‎Jump processes are applied to model different and complex situations in cyber games‎. ‎Applying jump processes we propose some m...

متن کامل

Screening and Optimization of Microextraction of Pb(II) by Inductively Coupled Plasma-Atomic Emission Using Response Surface Methodology

Dispersive liquid–liquid microextraction (DLLME) combined with inductively coupled plasma-atomic emission spectrometry (ICP-AES) was applied for the determination of lead in different environmental water samples. Ammonium pyrrolidine dithiocarbamate (APDC), chloroform and ethanol were used as chelating agent, extraction solvent and disperser solvent, respectively. The effective parameters, such...

متن کامل

Learning for Sequence Extraction Tasks

We consider the application of machine learning techniques for sequence modeling to Information Retrieval (IR) and surface Information Extraction (IE) tasks. We introduce a generic sequence model and show how it can be used for dealing with different closed-query tasks. Taking into account the sequential nature of texts allows for a finer analysis than what is usually done in IR with static tex...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998